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Abstract 

To combine the high power efficiency of Continuous Phase Modulation (CPM) with either high spectral efficiency or enhanced 
performance in low Signal to Noise conditions, some authors have proposed to introduce CPM in a MIMO frame, by using 
Space Time Codes (STC). In this paper, we address the code design problem of Space Time Block Codes combined with CPM 
and introduce a new design criterion based on L 2 orthogonality. This L 2 orthogonality condition, with the help of simplifying 
assumption, leads, in the 2x2 case, to a new family of codes. These codes generalize the Wang and Xia code, which was based on 
pointwise orthogonality. Simulations indicate that the new codes achieve full diversity and a slightly better coding gain. Moreover, 
one of the codes can be interpreted as two antennas fed by two conventional CPMs using the same data but with different alphabet 
sets. Inspection of these alphabet sets lead also to a simple explanation of the (small) spectrum broadening of Space Time Coded 
CPM. 



I. Introduction 



Since the pioneer work of Alamouti [1] and Tarokh [2], Space Time Coding has been a fast growing field of research where 
numerous coding schemes have been introduced. Several years later Zhang and Fitz [3], [4] were the first to apply the idea 
of STC to continuous phase modulation (CPM) by constructing trellis codes. In [5] Zajic and Stiiber derived conditions for 
partial response STC-CPM to get full diversity and optimal coding gain. A STC for noncoherent detection based on diagonal 
blocks was introduced by Silvester et al. [6]. 

The first orthogonal STC for CPM for full and partial response was developed by Wang and Xia [7], [8]. The scope of 
this paper is also the design of an orthogonal STC for CPM. But unlike Wang-Xia aprroach [8] which starts from a QAM 
orthogonal Space-Time Code (e.g. Alamouti's scheme [1]) and modify it to achieve continuous phases for the transmitted 
signals, we show here that a more general L 2 condition is sufficient to ensure fast maximum likelihood decoding with full 
diversity. 

In the considered system model (FigJTJ, the data sequence dj is defined over the signal constellation set 



n d = {— M + 1, -M + 3, . . . , M - 3, M - 1} (1) 



• £h ^ or an a lph a bet with log 2 M bits. To obtain the structure for a Space Time Block Code (STBC) this sequence is mapped to data 
'matrices with elements dmr, where m denotes the transmitting antenna, r the time slot into a block and (i) a parameter 



^ . for partial response CPM. The data matrices are then used to modulate the sending matrix 



S(f) 

Each element is defined for (21 + r - 1)T < t < (21 + r)T as 



sii(i) si 2 (t) 

S2l(i) S 2 2(t) 



(2) 



W(*) = yf * a * Mt) (3) 
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Fig. 1. Structure of a MIMO Tx/Rx system 

where E s is the symbol energy and T the symbol time. The phase <p m r(t) is defined in the conventional CPM manner [9] 
with an additional correction factor c mr (t) and is therewith given by 

22+r 

4>mr{t) = 9 m (2l + r )+h^2 dW «(* - (* - !) T ) + <W(<) (4) 
z=22+l+r — f 

where h — 2mo/p with mo and p relative primes is called the modulation index. The phase smoothing function q(t) has to 
be a continuous function with q(t) = for t < and q(t) — 1/2 for t > jT. 

The memory length 7 determines the length of q(t) and affects the spectral compactness. For large 7 we obtain a compact 
spectrum but also a higher number of possible phase states which increases the decoding effort. For full response CPM, we 
have 7 = 1 and for partial response systems 7 > 1. 

The choice of the correction factor c mr (t) in Eq. (0| is along with the mapping of dj to D«, the key element in the design 
of our coding scheme. It will be detailed in Section ITT1 We then define 6 m (2l + r) in a most general way 

9 m {2l + 3) = 9 m (2l + 2)+ £,(21 + 2) 

= 6 m (2l + l)+£(2l + l) + ^(2l + 2). (5) 

The function £(21 + r) will be fully defined from the contribution c mr (t) to the phase memory m (2l + r). For conventional 
CPM system, c mr (t) = and we have £(2Z + 1) = , ^d 2 i + i- 1 . 

The channel coefficients a mn are assumed to be Rayleigh distributed and independent. Each coefficient a mn characterizes 
the fading between the m th transmit (Tx) antenna and the n th receive (Rx) antenna where n = 1, 2, . . . , L r , Furthermore, the 
received signals 

y n (t) = a mn s mr (t) + n(t) (6) 

are corrupted by a complex additive white Gaussian noise n(t) with variance 1/2 per dimension. 

At the receiver, the detection is done on each of the L r received signals separately. Therefore, in general, each code block S(t) 
has to be detected by block. E.g. for a 2x2 block, estimating the symbols dj implies computational complexity proportional to 
M 2 . Now, this complexity can be reduced to 2 M by introducing an orthogonality property as well as simplifying assumptions 
on the code. 

Criteria for such STBC are given in Section [TT] In Section [Till the criteria are used to construct OSTBC for CPM. In Section 
[lV]we test the designed code and compare it with the STC from Wang and Xia [8]. Finally, some conclusions are drawn in 
Section |V] 

II. Design Criteria 

The purpose of the design is to achieve full diversity and a fast maximum likelihood decoding while maintaining the 
continuity of the signal phases. This section shows how the need to perform fast ML decoding leads to the L 2 orthogonality 
condition as well as to simplifying assumptions, which can be combined with the continuity conditions. For convenience we 
only consider one Rx antenna and drop the index n in a mn . 



A. Fast Maximum Likelihood Decoding 

Commonly, due to the trellis structure of CPM, the Viterbi algorithm is used to perform the ML demodulation . On block 
I each state in the trellis has M 2 incoming branches and M 2 outgoing branches with a distance 



(2/+l)T (22+2)T 

2 



di. (7) 



Di = J y(t) - ^2 a m s ml (t) dt + J y(t) - ^ a m s m2 (t) 

2/T m=1 (2/ + l)T m=1 

The number of branches results from the blockwise decoding and the correlation between the sent symbols si r (t) and s 2r (t). 
A way to reduce the number of branches is to structurally decorrelate the signals sent by the two transmitting antennas, i.e. to 
put to zero the inter-antenna correlation 

(2Z+1)T (2/+l)T (2/+2)T (22+2)T 

a 2 a\ J s^^s^^dt + aia 2 Jsn(t)s 21 (t) dt + a 2 alj s 22 (t)sl 2 (t) dt + aia 2 J si 2 (t)s* 22 (t) dt = 0. (8) 

2/T 21T (2/+l)T (2/+l)T 

Pointwise orthogonality as defined in [8] is therefore a sufficient condition but not necessary. A less restrictive L 2 orthogo- 
nality is also sufficient. From Eq. ([HJ, the distance given in Eq. © can then be simplified to 

(2/ + l)T (2/+2)T 

Di= Jfii(t) + f 2 i(t)-\y(t)\ 2 dt+ J / ia (i) + f 22 (t) - \y(t)\ 2 dt (9) 

2/T (2/+l)T 

with f mr {t) = \y{t) — a m s mr (t)\ 2 . When each s mr (t) depends only on d 2 i+i or d 2 \+ 2 the branches can be split and calculated 
separately for d 2 i+i and d 2 i+2- The complexity of the ML decision is reduced to 2M. The complexity for detecting two 
symbols is thus reduced from pM 1+1 to pM 1 . The STC introduced by Wang and Xia [8] didn't take full advantage of the 
orthogonal design since s mr (t) was depending on both d 2 i + i and d 2 i+ 2 . The gain they obtained in [8] was then relying on 
other properties of CPM, e.g. some restrictions put on q(t) and p. These restrictions may also be applied to our design code, 
which would lead to additional complexity reduction. However, this is not in the scope of this paper and is be the subject of 
another upcoming paper. 

B. Orthogonality Condition 

In this section we show how L 2 orthogonality for CPM, i.e. ||S(t)||£ 2 = J 2 % +2)T S(t)S H (t) dt = 21, can be obtained. As 
such, the correlation between the two transmitting antennas per coding block is cancelled if 

(2/+2)T (2/ + l)T (2/+2)T 

J s Xr {t)s* 2r {t)dt= J s n (t)s* 21 (t)dt+ J s 12 (t)s* 22 (t)dt = 0. (10) 

21T 2/T (2/+l)T 

Replacing s mr (t) by the corresponding CPM symbols from Eq. @, we get 

( 2i+1 > T 21+1 21+2 

/ exp{j27r[0i(2Z + 1) + hJ^d+Mt !) T ) + ci,i(*) - H 21 + 1 )~ h J2 d 2°i9(* ~ ( l ~ - c 2.iM] } d *+ 

" — of i o ~, ,' — of i Q 



i-2l+2 — f i=2/+3--y 

(2l+2)T 



21T 

2/+2 21+1 



exp|j27r[6' 1 (2f! + 2) + h^d { ^ 2 q(t - (i - l)T)+ci, 2 (t) - 2 (2Z + 2) - h ^ d { *+ 1] q{t - iT) - c 2 , 2 (t)] dij = 0. (11) 

a — 07 i n „, „- — of i o 



(2l-\-X)T i-2i + 3-7 i = 2Z + 2- 7 



The phase memory 6 m (2l + r) is independent of time and has not to be considered for integration. Using Eq. © to replace 
phase memory 9 m (2l + 2) of the second time slot, we obtain 



C 2i + 1 > T 21 + 1 21+1 



exp {j2w[h Y ~ (* ~ + ci,i(t) - h ^ ~ (* ~ !) T ) ~ C2,i(t)] } dt+ 

2lT i=2l+2 — r i=2l+2-j 

& l +p T 21+1 

exp {j27r [$i(2Z + 1) - ^2(2Z + 1)] } ■ / exp [j2n[h ~ (* ~ + c L2(i 

2^ z— 2Z+2— 7 

2i + l 

^ XI 4'a - (* - ~ ca,2(* + r)] } dt = 0. (12) 

i=2!+2-7 

C. Simplifying assumptions 

To simplify this expression, we factor Eq. ( fT2l into a time independent and a time dependent part. For merging the two 
integrals to one time dependent part, we have to map d^ 2 to and c mr (t) to a different c m > r /(t). Consequently, for the 

(i) 

data symbols dmr there exist three possible ways of mapping: 

• crosswise mapping with d^\ — d 2 \ and d^ \ = d 2 \; 

• repetitive mapping with = d^\ and d 2 \ = d 2 \; 

• parallel mapping with d± \ — d 2 \ and d^\ — d 2 \ . 
The same approach can be applied to c mr (t): 

• crosswise mapping with cu(t) = —C22(t — T) and C\2(t) = — C2i(t — T); 

• repetitive mapping with cn(t) = Cx%(t — T) and C2i(t) = C22(i — T); 

• parallel mapping with Cn(i) = C2i(t) and Ci2(t) = C22(i). 

For each combination of mappings, Eq. ( fT2l is now the product of two factors, one containing the integral and the other a 
time independent part. To fulfill Eq. (fT2l it is sufficient if one factor is zero, namely 1 + eP 2n ^ 1 ^ 2l+1 ^^ 2 ^ 2l+1 " = 0, i.e. if 

k+ \ =fc(2f+l) -6(2/ + 1) (13) 
with fc G N. We thus get a very simple condition which only depends on £ m (2Z + 1). 

D. Continuity of Phase 

In this section we determine the functions £ TO (2/ + 1) to ensure the phase continuity. 

Precisely, the phase of the CPM symbols has to be equal at all intersections of symbols. For an arbitrary block I, it means 
that 4> m i{{2l + 1)T) = 4> m 2((2l + 1)T). Using Eq. ©, it results in 

2i+l 2Z+2 

C m (2l + 1) = i£y® iiq ((2l + 2 - i)T) + c m ,i((2i + 1)T) - fc^d^UpZ + 2 - i)T) - c m2 ((2Z + 1)T). (14) 

i=2i+2-7 i=2i+3-7 

For the second intersection at (2/ + 2)T, since <f> m 2((2l + 2)T) = (j) m \{{2l + 2)T), we get 

2Z+2 2(2 + l) + l 

£ m (2Z + 2) = h Y,d%{{2l + 3 - i)T) + c m2 {(2l + 2)T) - h ^ d£? iX g((2J + 3 - i)T) - c ro ,i((2Z + 2)T)]. (15) 

i=2i+3-7 i=2(2 + l)+2-7 

Now, by choosing one of the mappings detailed in Section [Till these equations can be greatly simplified. Hence, we have 
all the tools to construct our code. 



III. Orthogonal Space Time Codes 
In this section we will have a closer look at two codes constructed under the afore-mentioned conditions. 



A. Existing Code 

As a first example, we will give an alternative construction of the code given by Wang and Xia in [8]. Indeed, the pointwise 
orthogonality condition used by Wang and Xia is a special case of the L 2 orthogonality condition, hence, their ST-code can 
be obtained within our framework. 

(i) 

For the first antenna Wang and Xia use a conventional CPM with = di for i = 2l + r+l— 7, 2/ + r + 2 — 7, . . . , 21 + r 

and c\ r {t) = 0. The symbols of the second antenna are mapped crosswise to the first d 2 \ = — c?i+i for i = 21 + 2 — 7, 21 + 

(i—l) 

3 — 7, . . . , 21 + 1 and d 22 = — <2»-i for i — 21 + 3 — 7, 21 + 4 — 7, . . . , 21 + 2. Using this cross mapping makes it difficult to 
compute £, m (2l + 1) since the CPM typical order of the data symbols is mixed. Wang and Xia circumvent this by introducing 
another correction factor for the second antenna 

7-1 

c 2r (t)=J2(h(d 2 i+i-i + dai+a-i) + l)flb(* - (21 + r - 1 - i)T) (16) 

i=0 

By first computing £ m (2Z + 1) with Eq. dTTb and then Eq. ( fT3l) . we get the L 2 orthogonality of the Wang-Xia-STC. 

B. Parallel Code 

To get a simpler correction factor as in [8], we designed a new code based on the parallel structure which permits to 
maintain the conventional CPM mapping for both antennas. Hence we choose the following mapping: dfy = d^ 2 ^ = di for 
i = 21 + r + 1 - 7, 21 + r + 2 - 7, . . . , 21 + r. Then, Eq. (Qj) and (O can be simplified into 

+ 1) = 7^+2-7 + c m i((2/ + 1)T) - c„ l2 ((2Z + 1)T) (17) 

£ m (2^ + 2) = ^d 2 /+ 3 - 7 + c„ l2 ((2Z + 2)T) - c rnl ((2l + 2)T). (18) 

With this simplified functions, the orthogonality condition only depends on the start and end values of c mr (t), i.e. 

k + i = c u ((2Z + 1)T) - c 12 ((2Z + 1)T) - cai((2Z + 1)T) + c 22 ((2Z + 1)T). (19) 

To merge the two integrals in Eq. (fT2l not only the mapping of dmr is necessary but also an equality between different c rnr (t). 
From the three possible mappings, we choose the repeat mapping because of the possibility to set c mr {t) to zero for one 
antenna. Hence we are able to send a conventional CPM signal on one antenna and a modified one on the second. Using Eq. 
(fT9l and the equalities for the mapping, we can formulate the following condition 

k+\ = c 12 (2lT) - c 12 ((2Z + 1)T) - c 22 (2ZT) + c 22 ((2Z + 1)T). (20) 

With cn(t) = ci 2 (t) = 0, we can take for c 2 i(i) = c 22 (£) any continuous function which is zero at t — and 1/2 at t — T. 
Another possibility is to choose the correction factor of the second antenna with a structure similar to CPM modulation, i.e. 

21+1 

C2r(*)= £ (21) 
1=21 + 1-7 

for (21 + r — 1)T < t < (21 + r)T. With this approach, the correction factors can be included in a classical CPM modulation 
with constant offset of 1/h. This offset may also be expressed as a modified alphabet for the second antenna 

Q d2 ={-M+l + -,-M + 3+-,...,M-3 + -,M-l + -} (22) 
h h h h 

Consequently, this L 2 -orthogonal design may be seen as two conventional CPM designs with different alphabet sets £1^ and 

Q,d2 for each antenna. However, in this method, the constant offset to the phase may cause a shift in frequency. But as shown 

by our simulations in the next section, this shift is quite moderate. 

IV. Simulations 

In this section we verify the proposed algorithm by simulations. Therefore a STC-2REC-CPM-sender with two transmitting 
antennas has been implemented in MATLAB. For the signal of the first antenna we use conventional Gray-coded CPM with 
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Fig. 2. Simulated BER for different numbers of Tx and Rx antennas of the proposed STC and of the Wang-Xia-STC 
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Fig. 3. Simulated psd for each Tx antenna of the proposed STC (continuous line) and the Wang-Xia-STC (dashed line) 

a modulation index h = 1/2, the length of the phase response function 7 = 2 and an alphabet size of M — 8. The signal of 
the second antenna is modulated by a CPM with the same parameters but a different alphabet Qd 2 , corresponding to Eq. d22l) . 

The channel used is a frequency flat Rayleigh fading model with additive white Gaussian noise. The fading coefficients a mn 
are constant for the duration of a code block (block fading) and known at receiver (coherent detection). The received signal 
y n (t) is demodulated by two filterbanks with pM 2 filters, which are used to calculate the correlation between the received 
and candidate signals. Due to the orthogonality of the antennas each filterbank is independently applied to the corresponding 
time slot k of the block code. The correlation is used as metric for the Viterbi algorithm (VA) which has pM states and M 
paths leaving each state. In our simulation, the VA is truncated to a path memory of 10 code blocks, which means that we get 
a decoding delay of 2 • 10T. 

From the simulation results given in Figure |2] we can reasonably assume that the proposed code achieves full diversity. 
Indeed, the curves for the 2x1 and 2x2 systems respectively show a slope of 2 and 4. Moreover, the curve of the 2x1 systems 
follows the same slope as the ST code proposed by Wang and Xia [8], which was proved to have full diversity. Note also that 
the new code provides a slightly better performance. 

A main reason of using CPM for STC is the spectral efficiency. Figure [5] show the simulated power spectral density (psd) 



for both Tx antennas of the proposed ST code (continuous line) and the ST code proposed by Wang and Xia [8]. The first 
antenna of our approach uses a conventional CPM signal and hence shows an equal psd. The spectrum of the second antenna 
is shifted due to adding an offset c mr (t) with a non zero mean. Minimizing the difference between the two spectra by shifting 
one, result in a phase difference of 0.375 measured in normalized frequency / • T4, where Td — T/ log 2 (M) is the bit symbol 
length. The first antenna of the Wang-Xia-algorithm has almost the same psd while the spectrum of the second antenna is 
shifted by approximately 0.56/ • Td- This means that the OSTC by Wang and Xia requires a slightly larger bandwidth than 
our OSTC. 

V. Conclusion 

In applications where the power efficiency is crucial, combination of Continuous Phase Modulation and Space Time Coding 
has the potential to provide high spectral efficiency, thanks to spatial diversity. To address this power efficiency, ST code design 
for CPM has to ensure both low complexity decoding and full diversity. To fulfill these requirements, we have proposed a 
new L 2 orthogonality condition. We have shown that this condition is sufficient to achieve low complexity ML decoding and 
leads, with the help of simplifying assumption to a simple code. Moreover, simulations indicate that the code most probably 
achieves full diversity. Further work will be concentrated on the design of other codes based on L 2 orthogonality as in the 
meanwhile, we have been able to obtain the design of full diversity, full rate L 2 orthogonal codes for 3 antennas [10]. 
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